Motion Analysis by Synthesis: Automatically Annotating Activities in Video

نویسنده

  • Deva Ramanan
چکیده

An important vision problem is to automatically describe what people are doing a sequence of video. This problem is difficult for many reasons. First, tracking the articulations of people is difficult because arms and legs are small and can move quite fast. Secondly, one must sew the estimated poses from each frame together into a coherent motion path; this is difficult because people can move in unpredictable ways. Finally, one must describe the motion with some activity annotation. This appears difficult for everyday motion since there may not be a canonical vocabulary of motions. We describe a fully automatic system that labels the activities of multiple people in a video sequence. The system decouples the choice of annotation vocabulary from the analysis procedure, allowing for easy revision of the vocabulary. The system first uses a pictorial structure model to independently detect 2D poses in each frame. The system then synthesizes 3D motion clips that looks like the 2D motions by matching poses to a stored library of motion capture data. The motion capture data is labeled offline with a class structure that allows for multiple annotations to be composed; one may walk, while waveing, for example. The lack of a canonical vocabulary also makes it difficult to evaluate experimental results. We introduce a mutual information criterion that allows one to evaluate different annotation systems given labeled test footage. We demonstrate and evaluate our system on real sequences of multiple people interacting and commercial shots from a feature-length film.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Traffic Scene Analysis using Hierarchical Sparse Topical Coding

Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...

متن کامل

An Improved Motion Vector Estimation Approach for Video Error Concealment Based on the Video Scene Analysis

In order to enhance the accuracy of the motion vector (MV) estimation and also reduce the error propagation issue during the estimation, in this paper, a new adaptive error concealment (EC) approach is proposed based on the information extracted from the video scene. In this regard, the motion information of the video scene around the degraded MB is first analyzed to estimate the motion type of...

متن کامل

Analysis and Synthesis of Facial Expressions by Feature-Points Tracking and Deformable Model

Face expression recognition is useful for designing new interactive devices offering the possibility of new ways for human to interact with computer systems. In this paper we develop a facial expressions analysis and synthesis system. The analysis part of the system is based on the facial features extracted from facial feature points (FFP) in frontal image sequences. Selected facial feature poi...

متن کامل

MediaBrain: Annotating Videos based on Brain-Computer Interaction

Adding notes to time segments on a video timeline makes it easier to search, find, and playback important segments of the video. Various approaches have been explored to annotate videos (semi) automatically to summarize videos. In this research we investigate the feasibility of implicitly annotating videos based on brain signals retrieved from a Brain-Computer Interface (BCI) headset. The signa...

متن کامل

Summarization of Video - taped Presentations : Automatic Analysis of Motion

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. XX, NO. Y, SEPTEMBER 1998 Summarization of Video-taped Presentations: Automatic Analysis of Motion and Gesture Shanon X. Ju, Michael J. Black, Scott Minneman, Don Kimber Abstract| This paper presents an automatic system for analyzing and annotating video sequences of technical talks. Our method uses a robust motion estimation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005